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5 Transmission Unit Receiving and Storing Means 

[Field of invention] 

The present invention relates to a receiving and storing 
10 means for receiving and storing transmission units 

containing information and to a corresponding method. Such 
a receiving and storing means is connected to a 
transmission means for such transmission units, to thereby 
form a communication system, the transmission units having 
15 a format such that besides other data, they also contain 
data relating to properties of each individual 
transmission unit. 

[Background of invention] 

20 

Electronic message systems that transmit so called 
electronic mails (e-jnails) have become very important as a 
quick and efficient means of exchanging information. In 
such systems, one user can compose a message and send it 
25 to another user by specifying that user's address. This 

message is then translated into a format that is suitable 
for the transmission system to which both users are 
connected, and then routed through said transmission 
system to the addressee. 

30 

A well known system that supports such an electronic 
message system is the so called internet. Besides this 
global system, there also exist networks which are 
restricted to a given number of subscribers at a given 
35 location, e.g. a given number of users in one company, 
where such systems are referred to as intranets. It is 
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also possible to combine local and global message transfer 
systems, e.g. by letting the user in an intranet send 
messages to other users in the intranet, or send out 
messages to subscribers of the global net through a so 
5 called gateway, i.e. a connection point between the 
intranet and the global network. 

Such networks therefore allow communication between a 
message source and one or more message receivers. With 

10 respect to the communication between the source and the 
receiver, two questions need to be answered, namely what 
can be transferred and how can it be transferred. The 
first question refers to the syntax and the second to the 
protocol. In the presently used network communications 

15 systems described above, two major protocol standards have 
emerged, namely the so called Simple Mail Transfer 
Protocol (RFC 821) and the OSI Message Handling System 
(X.400 series) . In order to perform the transferring of 
data, both of these protocols assign so called user agents 

20 as communication front ends for any given user. Figure 6 
schematically shows an example of a message handling 
system based on these protocols. A user agent will collect 
outgoing messages and incoming messages. In order to send 
a message, the user agent communicates with a transfer 

25 agent, which in turn handles the routing of the message 
through the network to the receiver's user agent. On the 
receiving side, the message is then routed to a so called 
mailbox by a transfer agent. The messages collected in the 
mailbox can then be called up by the receiver, via his 

30 user agent. The source or sending unit A, the network B 
and the receiving unit C form a communication system. As 
indicated by the identical structure shown in Figure 6, 
both units A and C can send and receive messages. 



35 The syntax of the transmission units or messages is 

generally divided into two parts, namely a header and a 
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body. While the body carries the actual contents of the 
message, i.e. the information that the source desires to 
communicate to the receiver, the header contains the 
information relating to the message itself, e.g. the 
5 author's and receiver's address, as well as information 
concerning the time or date and the subject of the 
message. The body can consist of plain text or of binary 
encoded information representing any kind of computer data 
format. The transmission of several body parts that all 
10 belong to single message is also supported, A message that 
is divided in this way, is then reassembled at the. 
receiver's end. 

For the two protocol standards mentioned above, the syntax 
15 is specified in RFC 822 and RFC 1521/RFC 1522 for RFC 821 
and X.419 for OSI. This is disclosed in detail in the 
following documents, so that no description is given in . 
the framework of this application. 

20 Crocer, David H., RFC 822: Standard for the format of ARPA 
Internet text messages, DDN Networking Information Center, 
SRI International, August 1982. 

Borenstein N . , Freed N., RFC 1521: MIME (Multipurpose 
Internet Mail Extension) Part one: Mechanisms for 
25 specifying and describing the format of Internet message 
bodies, DDN Network Information Center, SRI International, 
September 1993. 

Moore K., RFC 1522: MIME (Multipurpose Internet Mail 
Extensions) Part two: Message header extensions for non- 
30 ASCII text, DDN Network Information Center, SRI 
International, September 1993. 

CCITT Study Group VII, Data Communication Networks: 
Message Handling Systems (MHS ) , volume VIII, 
Recommendations X.400-X.420 of the Red Book Series, 
35 International Telecommunication Union, 1984. 
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CCITT Study Group VII, Data Communication Networks: 
Message Handling Systems (MHS) , volume VIII, 
Recommendations 

X.400-X.420 of the Blue Book Series, International 
5 Telecommunication Union, 1989. 



[Prior art and problem] 



In message handling systems, of the above described type, a 

10 user agent will store newly received messages in an 

appropriate storage means. As already mentioned, these 
newly received messages consist of data relating to the 
actual contents of the message and of data relating to the 
message itself, e.g. the author, the recipient, the 

15 subject, keywords etc. . The messages are typically stored 
into a predetermined group, i.e. into a folder referred to 
as an in-box. In this folder, the messages can be sorted 
according to arbitrary criteria, e.g. in the order of 
their respective times of being received or sent, or they 

20 can be sorted according to authors, etc. . The user can 

then retrieve the messages from the in-box and go through 
them, to decide which messages to keep, which messages to 
delete, and where to store messages that he desires to 
keep. Often these actions will be highly repetitive for 

25 recurring messages, e.g. a message relating to a specific 
subject (e.g. a project) may always be stored into a 
specific folder. This action of scanning newly received 
mails can be very tedious and time consuming, especially 
if a user receives a large volume of messages. It also 

30 constitutes a burden for the storage means, as stored 
messages need to be read and re-stored. Thereby, this 
method of receiving and storing electronic messages is 
technically inefficient . 



35 To avoid this time consuming work and to achieve 

automation, systems have been proposed that allow a 
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sorting of messages in accordance with rules that the user 
determines previously. Such a system is e.g. described in 
OFFICE COMPUTING REPORT, Vol.15, No. 9, September 1992. 
Such systems are also referred to as a rule generators. 
5 The rules allow the system to automatically match specific 
mails in accordance with their properties. These rules 
typically have a when-if-then structure (when an event 
happens, and if the event meets certain conditions, then a 
specific action is taken) . An example of this is: when a 
10 new message is delivered to the in-box that carries the 
subject ^meeting"', and if it is from the supervisor, then 
the message is stored in the meeting folder. 

Such known systems have the disadvantage that the rules 
15 for allocating specific messages to specific groups 

(folders) must still be created by the user. This is not 
simple, because the user must be able to foresee which 
mails will arrive and how they are to be treated. This is 
especially difficult for users receiving a large amount of 
20 mail from various sources. Also, once a rule is 

determined, it remains unchanged and in time may no longer 
be suited to handle the changing situation relating to the 
properties of the received mails. As a consequence, the 
user must regularly check and update the applied rules. 
25 This again is a time consuming task that keeps the user 
from performing the actual work at hand, namely 
concentrating on the contents of the messages and 
responding accordingly. Otherwise, as it can not adapt, 
the system in time becomes inefficient . 



The above problems, which are encountered when dealing 
with the handling of electronic mail messages, are not 
restricted to the specifically described systems. Much 
rather, such problems will always occur in systems that 
allocate received transmission units (e-mails in the above 



30 
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described example) into specific groups in a storage 



means . 



[Object] 



5 



Consequently, the object of the present invention consists 



in providing a system and a method for automatically 
allocating transmission units into given groups, where the 
allocation itself is automatically performed and can 
10 automatically adapt to changing conditions, so that a 
highly efficient receiving and storing system can be 
achieved. 

[Brief description of invention] 

15 

This object is solved by a system and method employing a 
weighted sum with respect to the determination of 
allocation decision parameters from the comparison between 
the structure of previously stored transmission units and 
20 the newly to be stored transmission units. 

According to the invention, when receiving a new 
transmission unit (e.g. an e-mail), the system determines 
the data relating to a predetermined number of properties 

25 (e.g. author, receiver, and subject) of the received unit. 
This data or field value of a property field (e.g. the" 
name of the author indicated in the author field of the 
message) is then compared with the corresponding data 
(e.g. the names of authors) in units previously stored in 

30 a predetermined number of groups (e.g. folders) . It is 

determined how often the data related to each property of 
the newly received unit is contained in each of the groups 
under consideration (e.g. group 1; name of author of 
received message is given as author in 10 stored messages, 

35 name of receiver of received message is given as receiver 
in 52 stored messages, subject of received message is 
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given as subject in 4 stored messages; group 2: name of 
author of received message is given as author in 17 stored 
messages, name of receiver of received message is given as 
receiver in 34 stored messages, subject of received 
5 message is given as subject in 0 stored messages; etc.). 
Thereby the system determines data occurrence values for 
each property and each group (in the above example: 10 for 
property „author" in 

group 1, 52 for property „receiver" in group 1, etc.). 

10 

The system then multiplies each data occurrence value with 
a weight factor that is associated with the property and 
group. In each group, the resulting products are summed 
over the properties and divided by the number of units in 

15 the given group, to thereby generate storage decision 

values associated with each group under consideration. The 
received unit is then allocated to any one of all the 
possible groups (e.g. one of the folders under 
consideration or into a default folder) on the basis of 

20 the storage decision values. 

Due to basing the allocation decision on decision values 
that depend on the units already stored in the system, the 
present invention achieves a receiving and storing system 
25 that can all at once automatically store new units into 
groups, and automatically adapt the decision to the 
present state of the previously stored units. Due to this 
automatic adaptation, the system of the present invention 
is highly flexible and very efficient. 



30 



[Description of figures; 



Further advantages and features of the invention can be 
better understood from the following detailed description 
35 of preferred embodiments of the invention, taken together 
with the accompanying figures, in which: 
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Fig. 1 shows a general embodiment of the present 
invention; 

5 Fig. 2 shows an example of the format of transmission 
units 

in accordance with the present invention; 

Fig, 3 schematically shows how a weighted sum is 
10 calculated 

for a given group; 

Fig. 4 shows a flow chart of the method according to an 
embodiment of the present invention; 

15 

Fig. 5 shows an example of a graphical user interface 
for 

allowing a user to adjust parameters used in an 
embodiment of the present invention; and 

20 

Fig. 6 shows a communication system which routs 



messages 



from a source to a receiver via a network. 



25 [Detailed description of embodiments] 

In the following, the present invention shall be described 
by way of preferred embodiments. 



30 Fig. 1 shows a schematic outline of a general embodiment 
of the present invention. The receiving and storing means 
1 consists of a storing means 2, in which transmission 
units TRU are stored into predetermined groups 
Gi, . . . , Gi, . . . , G n . Furthermore, a data determination means 

35 3 is provided, which determines the data relating to 

specific properties of a received transmission unit, and 
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the corresponding data associated with the same specific 
properties in transmission units already stored in the 
storage means 2. 



5 Fig. 2 shows an example of a transmission unit. The format 
of the shown transmission unit TRU is such that a header 
and a. body are provided. The header consists of several 
sections or fields 10-14, each containing data related 
to a specific property of said unit TRU (e.g. author, 

10 receiver, date, etc.), i.e. field values. It should be 
noted that Fig. 2 only shows an example. The specific 
format of the transmission units is of little importance 
to the present invention, as long as they contain data 
associated with defined properties, and the association 

15 between given data and a specific property can be 

discerned. In the format shown in Fig. 2, this is assured 
by having specific sections in a determined order, where 
each section contains the data relating to a specific 
property. For example, section 10 can always contain the 

20 name of the author, section 11 the name of the receiver, 
etc. . It is however equally well possible that the data 
relating to a specific property, i.e. the field value is 
identifiable by begin and end markers, so that this data 
can be located anywhere in the transmission unit. 

25 

The system shown in Fig. 1 furthermore comprises a 
comparison means 4, which compares the data from the 
incoming transmission unit with the data from the 
transmission units that are already stored. A comparison 

30 evaluation means 5 counts the number. -of times that a given 
information (e.g. a name) relating to one of a selected 
number of properties (e.g. author) of the incoming unit is 
contained in the property sections associated with said 
one property in the units stored in a certain number of 

35 groups, i.e. an occurrence value is determined. Said 
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counting is performed for each of the selected number of 
properties and for each of the certain number of groups. 

A calculation means 6 is provided for using the values 

5 counted by said comparison evaluation means, as a basis 
for calculating a weighted sum for each of said certain 
number of groups. Said weighted sum is calculated from the 
products of said occurrence values with multiplication or 
weight factors, each weight factor also depending on said 

10 property and said group, just as said occurrence value. An 
example of this is shown in Fig. 3, where the selected 
properties are "author", "receiver", "subject", "Cc", 
"keywords" and "local" (this example will be discussed in 
more detail later) . For example, the data contained in the 

15 property section associated with "author" in the received 
transmission unit (e.g. "Miller") is contained On times 
in the property sections associated with "author" in the 
units stored in group i. This value is then multiplied 
with the weight factor W u and added to the sum of other 

20 products consisting of the occurrence value Oji and the 
weight factor Wji that depend on the property j and the 



ni of units contained in the group i. The resulting value 
Si is a measure of how much the received transmission unit 
25 and the group under consideration have in common. The 

larger this value, the more the new transmission unit and 
the units in the considered group have in common. 

Finally, a storage decision means 7 controls the storing 
30 or sorting of the newly received transmission unit on the 
basis of the decision values Si for the groups under 
consideration. As already mentioned, these decision values 
Si give a measure of how strongly the data of the 



6 



group i . This sum 



2 °u ' w u ** s then divided by the number 
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considered properties of the received unit match the 
corresponding data in units stored in group i. 

Fig, 4 shows a flow chart of the method in accordance with 
the present invention, said method being employed by the 
system described above. In a first step SI it is 
determined if a new transmission unit has arrived. If yes, 
then the values contained in the predetermined property 
fields of said new unit are determined in step S2, and 
compared with the values contained in the corresponding 
property fields of transmission units stored in the groups 
under consideration in step S3. Thus, occurrence values 
Oji for the occurrence of the value of the j-th property 
of the new unit in the i-th group are determined. Then, in 
step S4, these occurrence values are multiplied with 

p 

weights Wji and the sum ^O^-W^ is calculated, where P 

>=i 

represents the number of preselected properties. This sum 
is then divided by the number ni of units contained in the 
group i, to thereby determine the decision values Si for 
each group under consideration. Finally/ in step S5, the 
new transmission unit is automatically allocated to a 
group on the basis of the decision values Si, 

Due to this arrangement, the present invention provides a 
25 receiving and storing means for transmission units that 
can not only automatically allocate newly received units 
into groups, but which also automatically adapts its 
automatic allocation by performing the allocation on the 
basis of weighted sums that are calculated on the basis of 
30 the units contained in the groups under consideration. 

Therefore, the allocation decision is always automatically 
adapted to the momentary state of the considered groups. 
This makes the system very flexible and very efficient. 
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There is a variety of possibilities for using the decision 
values Si for allocating a received transmission unit to a 
group. According to one embodiment, the decision values Si 
are simply compared with one another, and the received 

5 message is then allocated to the group i that has the 
largest decision value Si. In this way, the received 
transmission unit is allocated to the group with which it 
statistically has the most in common. According to another 
embodiment, the largest and second largest decision value 

10 are determined, and the received transmission unit is 

stored in only the group associated with the largest value 
if the difference between the largest value and the second 
largest value exceeds a certain limit, and is stored in 
both the group associated with the largest value and the 

15 group associated with the second largest value in the 

event that the difference is smaller than said limit. This 
embodiment modifies the previous embodiment to thereby 
provide a system that allows the allocation into two (or 
more) groups if their respective decision values are close 

20 together . 



According to a preferred embodiment, the allocation 
decision is performed in accordance with the following 
method. First the largest of the decision values Si is 

25 determined, and then this value is compared with a 

threshold value T. If this largest value is larger than 
the threshold T, then the received transmission unit is 
stored into the group to which said largest decision value 
belongs. If not, then the received transmission unit is 

30 allocated to a predetermined default group, where said 
default group does not belong to the groups under 
consideration, i.e. no decision value Si is calculated for 
this default group. 
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The threshold T fulfills a double purpose. On the one hand 
it serves to prevent arbitrary decisions in the event that 
all decision values are relatively small, i.e. that the 
data in the property sections of the received transmission 
5 unit has little in common with any of the groups under 
consideration. On the other hand it serves as an 
adjustment parameter for adjusting how the frequently the 
automatic allocation function will be used. Consequently, 
it is preferable that T is a parameter that the user can 

10 set. If T is set to a relatively large value, then most 
received transmission units will be allocated to the 
default group, i.e. the frequency of automatic allocation 
to a group under consideration will be low. If T is set to 
a relatively low value, then the opposite effect occurs, 

15 i.e. an automatic allocation is performed often. The 

employment of the threshold T therefore provides great 
flexibility to the system. 

Naturally, the decision method of storing the received 
20 transmission unit in two groups if the corresponding 

decision values Si are close together can be combined with 
the above embodiment using the threshold value T, i.e. a 
received transmission unit is stored in two groups if the 
decision values belonging to those two groups are close 
25 together and both exceed the value T. 

As explained above, the present invention employs a number 
of parameters when determining the decision values Si. 
First there is the number of properties under 

30 consideration (e.g. "author" and "receiver" =2; or 

"author", "receiver" and "subject"=3) . Then there is the 
number of groups under consideration (e.g. 4 of 10 
possible groups) . Preferably, both of these parameters can 
be adjusted by the user, so that the user can customize 

35 the system to his personal needs and tastes. In other 
words, the user can adjust beforehand (i.e. before 
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enabling the automatic allocation process) which 
properties of the transmission units are to be taken into 
consideration, and into which groups should it be possible 
to conduct automatic allocation. 



Furthermore, the system of the invention uses the above 
mentioned weight factors Wji* In one embodiment, these 
weight factors can be permanent values, but preferably 
these factors are again user adjustable parameters (i.e. 
10 the user can adjust these parameters prior to enabling the 
automatic allocation process) or are also automatically 
adapted to the momentary state of the groups under 
consideration. This latter feature will now be described. 

15 In accordance with a preferred embodiment of the 

invention, the weight factors Wji used for calculating the 
above described weighted sum are determined on the basis 
of the average disorder regarding the occurrence of data 
associated with the properties under consideration. The 

20 average disorder of elements contained in a plurality of 
branches is generally defined as (see Winston, Patrick 
Henry, Artificial Intelligence, 3rd edition, Addison 
Wesley, 1992) : - 



where n b is the number of elements in branch b, n t is the 
total number of elements in all branches, and rib C is the 
number of elements in branch b of class c. The value 1 
30 represents total disorder, while the value 0 represents 
total order. The concept of disorder for statistical 
distributions is associated with the concept of 
information entropy for probability distributions. 



5 



25 
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In accordance with the present invention, this general 
equation is applied to the system at hand by calculating 
the weight factor Wji for the j-th property of the 
considered properties and the i-th group under 
5 consideration by considering all transmission units in 
group i as one class and the units in the other groups 
under • consideration as a second class. In other words, in 
the context of the present invention, there only exist two 
classes . 

10 

First, it must be determined how many different .values 
exist for the j-th property. As an example, if the j-th 
property is the property "author", then it must be 
determined how many different names (these are the values) 

15 are to be found in this category in the transmission units 
stored in all of the groups under consideration. Let us 
assume that 20 names occur, which will be denoted by the 
subscript k, i.e. namei, name^, name2o- The number 

of times that name* (which belongs to property j) occurs 

20 in group i is denoted as Qji (namek) , whereas the number of 
times that namek occurs in the other groups under 
consideration is denoted as NQji (namek) • The total number 
of units (in all groups under consideration) in which 
namek occurs, is denoted as Tji (namek), where naturally 

25 Tji(namek)= Qji(name k )+ NQji (name k ) . The total number of 

units contained in all of the groups under consideration 
is denoted as N. In the given example, i.e. with 20 
different names, N is given by the following equation 



Consequently, the disorder d j i of the j-th property 
(„author" in this example) with respect to the i-th group 
is given as: 



20 



30 
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Tj^name^ Q^name,) ( Q fi (name t ) } NQ f ,(name k ) ^ f NQ }i (name k ) 



N 



Yv 



T fl (name k ) "^T^name,,) ) T fi (name k ) l0g \ rename,) 



J) 



Therefore, if the j-th property in general has Kj 
different values, where said values are the contents of 
the j-th property fields of the stored units and are 
referred to as value*, then the disorder dji is generally 
determined by 



^ T^value^f Q fi (va!ue k ) ^ ( Q Rvalue k )\ NQ fi (value k ) ( NQ fl (value k )V 
* h N I T^value,)' ° g \ T,, (value k ) ) T fi (value k ) ' ° S \ T fi (value k ) ) } 



The terms Qji/Tji represent the relative occurrence of 
valuek in group i. If one of the numbers Qji (value*) or 
NQji (value*) equals 0, which means that all units 
containing value* are in group i or that no unit with 
value* is contained in group i, then the corresponding 
product 



Q .{value k ) f Q Rvalue k ) 
— iog 2 " — 



T fi (value k ) 



T^value,) J 



NQ . (value,) 

or - log 2 

T^value,) 



NQ Jt (value k ) 
{ Tj; (value k ) J 



is defined as 0, because the limit value lim(x logx) for 
x->0 is equal to 0 . In this way, all possible values in 
the equation determining dji are defined. 

According to the present invention, the weights Wji are 
defined by the equation 



\-d n 



j* p 
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The sum in the denominator runs over all of the selected 
properties (referred to as J to thereby avoid confusion 
with respect to the value dji in the numerator), i.e. from 
1 to P, where P is the number of selected properties. 

5 Consequently, the weights Wji reflect the average disorder 
associated with values of the j-th property with respect 
to the i-th group. For example, if there is a large degree 
of order with respect to a given group i, i.e. most of the 
values associated with property j are identical in said 

10 group, then the disorder dji is low and the weight factor 
Wji is large. Consequently, the j-th property becomes 
prominent in the determination of the decision value Si of 
the i-th group. On the other hand, if there is a large 
degree of disorder with respect to the j-th property (i.e. 

15 dji large) , then the influence of the j-th property on the 
decision value Si is smaller. 

Due to this preferred embodiment, the system becomes more 
flexible and more efficient, because not only are the 

20 parameters for performing the allocation determined 

automatically on the basis of the momentary situation in 
the groups under consideration, but the parameters for 
calculating the decision parameters are also automatically 
adapted to the momentary situation in the groups under 

25 consideration. 

According to a further preferred embodiment, the system is 
arranged such that a time of expiration function is 
included. This function causes the. system to automatically 

30 delete stored transmission units once the value in the 

field associated with the time or date property indicates 
that they have been stored for a predetermined amount of 
time. Preferably, this predetermined time is different for 
each individual group, and can be set by the user prior to 

35 enabling the system. At regular intervals, e.g. once a 
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day, the system then checks the value in the date field of 
each stored transmission unit, determines the difference 
to the momentary date, and deletes the unit if this 
difference exceeds the expiration value set for the group 
5 in which the unit is stored. For example, if the user has 
set an expiration time of one week for a given group, then 
the system will delete those transmission units in the 
group who are older than one week with respect to the 
momentary date. 

10 

This feature ensures that the number of stored 
transmission units can automatically be regulated and the 
system does not become overburdened with excessive amounts 
of stored units. Especially, this feature enhances the 

15 basic mechanism of the present invention, because it 

ensures that the allocation, which is determined by the 
momentary state of the system, is indeed adapted to the 
latest message streams. In other words, older messages, 
which are automatically deleted by the time of expiration 

20 feature, no longer influence the allocation of new 

messages, so that the system will more rapidly adapt to 
the characteristics of the current message streams. 
Consequently, this feature improves the automatic 
flexibility of the system. 

25 

As already mentioned above, the user can preferably adjust 
several parameters of the system prior to enabling the 
automatic allocation system. According to another 
preferred embodiment, this adjusting or setting of 

30 parameters is accomplished with the help , of a so called 

graphical user interface, an example of which is shown in 
Fig. 5. Such a graphical user interface can e.g. be 
displayed on a display means such as a CRT, and the 
adjustment can be performed with the help of standard 

35 input devices , such as a mouse and a keyboard. Such 
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display means and input means are well known in the art, 
so that no description needs to be given here. 

In the figure, it is possible to adjust the threshold 
value T for determining the automatic sorting activity, 
the group for which parameters are to be adjusted can be 
selected, and a time of expiration can be set. 
Furthermore, the weights associated with the predetermined 
number of properties for the selected group are displayed. 



This feature greatly enhances the usability of the system, 
as a user can quickly and safely configure the system to 



A best mode for employing the invention consists in a 
software implemented system for handling electronic mails 
or e-mails. This best mode embodiment will now be 
described. This system is installed on the computer of a 
user who is suitably connected to a transmission network, 
e.g. to the internet by a commercial provider. The system 
comprises a graphic user interface as described above, of 
which an example is shown in Fig. 5. Before activating the 
system, the user must create a desired number of folders 
into which new e-mails are to be automatically allocated 
by the system. In accordance with this best mode, the 
system will use all of these folders as the above 
mentioned groups under consideration and will 
automatically create a default folder (e.g. entitled 
"general in-box") . 

When receiving a new e-mail, the system extracts the 
values from a predetermined number (previously adjusted by 
the user with the help of the graphical user interface) of 
property fields in the message, compares them with the 
values in the corresponding property fields,, pf the 
messages stored in the folders under consideration (in 



10 



his personal needs and tastes. 
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this case all of the folders except the default folder) , 
determines the corresponding occurrence values Oji and 
then determines the decision values Si for each of the 
folders under consideration as described above in 
5 connection with Fig. 3. The weights Wji are automatically 
determined on the basis of the disorder dji, i.e. by 



as explained in detail above. 

10 The system then determines the maximum decision value S ma x 
and compares this value with a user defined threshold T. 
This threshold T is determined beforehand by the user, 
with the help of the graphical user interface, as e.g. 
shown in Fig. 5. If S ma x larger than T, then the e-mail is 

15 allocated to the folder to which S max belongs. If not, the 
e-mail is allocated to the default folder. 

The best mode embodiment furthermore comprises the above 
mentioned time of expiration feature, where the expiration 
20 time can be individually adjusted for each folder, as also 
indicated in Fig. 5. The feature functions as described 
above . 

This embodiment constitutes one possibility of combining 
25 the features from the above described preferred 
embodiments with the initially described general 
embodiment of the invention, it should however be noted 
that these individual features contained in the various 
preferred embodiments can be combined in any desirable way 
30 with the initially described general embodiment, in 

accordance with the specific requirements and individual 
preferences that a person skilled in the art may have when 
putting the present invention to practice. 
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The above described embodiments are to be seen as examples 
of the present invention. The invention is however by no 
means restricted to these examples, as variations and 
5 modifications will readily occur to those skilled in the 
art. Much rather, the present invention is defined by the 
scope. of the appended claims. Reference signs in the 
claims do not restrict the scope and only serve the 
purpose of better understanding. 
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Claims 



15 



20 



25 



30 



A receiving and storing means (1) for transmission 

its 

(TRU) containing information, each transmission unit 
having a format such that it contains a plurality of 
unit property fields (10-14) that each contain a 
value associated with a property of said transmission 
unit, comprising: 

- storage means (2) for storing transmission units, 
said storage means being arranged such that any 
transmission unit stored therein is allocated to one 
or more of a plurality of predetermined groups (Gi, 
G2 / * • - / G n ; 1 ) , 

- data determination means (3) arranged such that 

the respective values in a predetermined number 
of unit property fields of a transmission unit 
received by said receiving means are determined, 
and, 

for certain groups of said storage means, 
corresponding values in the same predetermined 
number of unit property fields of all 
transmission units in said certain groups are 
determined, 

- comparison means (4) for comparing said values from 
said received transmission unit with the values from 
said transmission units contained in said certain 
groups, 
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- comparison evaluation means (5) arranged such that, 
for each of said certain groups, the number of times 
(Oji) is counted that a value from a specific unit 
property field (j) of said received transmission unit 
5 occurs in the same specific unit property field (j) 

of transmission units stored in said group (i) , to 
thereby determine a value occurrence value (Oji) for 
said same specific unit property field (j) of said 
group (i), 



10 



- calculation means (6) arranged such that 



each value occurrence value (Oji) for a given 
unit property field (j) and given group (i) is 

15 multiplied with a multiplication factor (WjjJ 

that depends on said given unit property field 
(j) and said given group (i) , to thereby 
calculate a number of group product values that 
are -equal in number to said predetermined number 

20 of unit property fields, and, 



for each of said certain groups, said group 
product values are added together to a sum, said 
sum being divided by the number (ni) of 
25 transmission units in said group (i) , to thereby 

generate a storage decision value (Si) for each 
of said- certain groups, and 

- storage decision means (7) for deciding in which of 
30 all groups of said storage means to store said 

received transmission unit, on the basis of said 
storage decision values (Si) of said certain groups. 



2. A receiving and storing means according to claim 1, 
35 wherein storage decision means (7) are arranged such that 
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the storage decision values (Si) are compared and the 
largest value is determined, said received transmission 
unit being stored in the group associated with said 
largest value. 

5 

3, A receiving and storing means according to claim 2, 
wherein said storage decision means (7) are arranged such 
that also the second largest of said storage decision 
values (Si) is determined, the difference between said 

10 largest and said second largest value is determined, and 
if said difference is smaller than a predetermined limit, 
said received transmission unit is also stored in the 
group associated with said second largest storage decision 
value (Si) . 

15 

4. A receiving and storing means according to claim 1, 
wherein said storage decision means (7) is arranged such 
that the storage decision values (Si) are compared and the 
largest value is determined, said largest value being 

20 compared to a threshold value (T) that depends on the 
group associated with said largest value, wherein said 
received transmission unit is stored in said group 
associated with said largest value if said largest value 
exceeds said threshold value, and otherwise in a 

25 different, predetermined group. 



5. A receiving and storing means according to claim 4, 
wherein said storage decision means (7) is arranged such 
that also the second largest of said storage decision 
30 values (Si) is determined, the difference between said 

largest and said second largest value is determined, and 
if said difference is smaller than a predetermined limit 
and said second largest value also exceeds said threshold 
value (T) , said received transmission unit is also stored 
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in the group associated with said second largest storage 
decision value (Si) . 



6. A receiving and storing means according to claim 1, 
5 wherein said calculation means (7) is arranged to 

automatically calculate each multiplication factor (Wji) 
for the associated property field (j) and group (i) on the 
basis of the disorder (dji) of the values from said 
associated property field (j) with respect to the 
10 transmission units stored in said group (i) in comparison 
to the transmission units stored in the other groups of 
said certain groups. 

7. A receiving and storing means according to claim 6, 
15 wherein 

said data determination means (3) is arranged to 
determine, for each of said predetermined number (P) of 
property fields, the number Kj of different values valuer 

20 contained in the respective property fields of the 

transmission units stored in said certain groups, said 
different values value*, the number of times Qji (value*) 
that each of said values value* of each of said property 
fields (j) occurs in a given one (i) of said certain 

25 groups, the number of times NQji (value*) that each of said 
values value* of each of said property fields (j) occurs 
in the other groups of said certain groups besides said 
one given group, the total number Tj± of times that one of 
said values value* occurs in the transmission units stored 

30 in all of said certain groups, and the total number N of 
transmission units stored in all of said certain groups, 



said calculating means is arranged to calculate said 
disorder dji through the following equation: 
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N 



Q y< (value k ) 
Tj, (value,) 



log 



XT, 



(value k ) 
Rvalue k ) J 



NQji (value k ) ( NQ fl (value k )\ 
1 log 2 1 

Tj, (value k ) { T.,(value k ) ) 



where the term 



Q fi (yalue k ) ^ ( Q fi (value k ) y 
T., (value,)' ° S \ T fi (value k ) ) 



is set equal to zero if the value Qji (valuer) is equal to 
zero, and the term 



NQ fi (value k ) ^ ( NQ^value^ 
T^value,) ' °H Rvalue k ) ) 



is set equal to zero if the value NQji (value*) is equal to 
zero, and said calculation means is arranged to calculate 
said multiplication factor Wji associated with each of 
said property fields (j) and each of said groups (i) 
through the following equation 



\-d e 
W 1S = — '- 



where P is the number of predetermined property fields 



8. A receiving and storing means according to claim 1, 
wherein a display means and a control means for said 
display means are provided, and said display means is 
arranged to display a graphical user interface for 
entering parameters. 



9. A receiving and storing means according to claim 1, 
wherein a means is provided, which is arranged to 
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regularly determine a characteristic date for each of said 
transmission units stored in said certain groups, compare 
said characteristic date with the current date, and delete 
said transmission unit if the difference between said 
5 characteristic date and the current date exceeds a 
predetermined time limit. 

10. A communication system comprising: 

- a receiving and storing means according to one of claims 
10 1 to 9, and 

- a transmission means connected to said receiving and 
storing means, for transmitting transmission units. 

11. A communication system according to claim 10, wherein 
15 said transmission means is a data network for carrying 

electronic mail messages, and said transmission units are 
electronic mail messages. 

12. A method for receiving and storing transmission units 
20 (TRU) containing information, each transmission unit 

having a format such that it contains a plurality of unit 
property fields (10-14) that each contain a value 
associated with a property of said transmission unit, said 
transmission units being stored in a storage means, where 
25 said storage means is arranged such that any transmission 
unit stored therein is allocated to one or more of a 
plurality of predetermined groups (Gi, G2, • G n ; 
i ), comprising the steps of: 

30 determining the respective values in a predetermined 

number of unit property fields of a received transmission 
unit, and, 



for certain groups of said storage means, determining 
35 corresponding values in the same predetermined number of 
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unit property fields of all transmission units in said 
certain groups, 

comparing said values from said received transmission unit 
5 with the values from said transmission units contained in 
said certain groups, 

counting, for each of said certain groups, the number of 
times (Oji) that a value from a specific unit property 
10 field (j) of said received transmission unit occurs in the 
same specific unit property field (j) of transmission 
units stored in said group (i) , to thereby determine a 
value occurrence value (Oji) for said same specific unit 
property field (j) of said group (i), 

15 

multiplying each value occurrence value (Oji) for a given 
unit property field (j) and given group (i) with a 
multiplication factor (Wji) that depends on said given 
unit property field (j) and said given group (i) , to 
20 thereby calculate a number of group product values that 
are equal in number to said predetermined number of unit 
property fields, and, 

adding, for each of said certain groups, said group 
25 product values together to a sum, said sum being divided 
by the number (ni) of transmission units in said group 
(i), to thereby generate a storage decision value (Si) for 
each of said certain groups, and 

30 deciding in which of all groups of said storage means to 
store said received transmission unit, on the basis of 
said storage decision values (Si) of said certain groups. 

13. A method according to claim 12, wherein the storage 
35 decision values (Si) are compared and the largest value is 
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determined, said received transmission unit being stored 
in the group associated with said largest value. 



14. A method according to claim 13, wherein also the 
5 second largest of said storage decision values (Si) is 

determined, the difference between said largest and said 
second largest value is determined, and if said difference 
is smaller than a predetermined limit, said received 
transmission unit is also stored in the group associated 
10 with said second largest storage decision value (Si) . 

15. A method according to claim 12, wherein the storage 
decision values (Si) are compared and the largest value is 
determined, said largest value being compared to a 

15 threshold value (T) that depends on the group associated 
with said largest value, wherein said received 
transmission unit is stored in said group associated with 
said largest value if said largest-value exceeds said 
threshold value, and otherwise in a different, 

20 predetermined group. 

16. A method according to claim 15, wherein also the 
second largest of said storage decision values (Si) is 
determined, the difference between said largest and said 

25 second largest value is determined, and if said difference 
is smaller than a predetermined limit and said second 
largest value also exceeds said threshold value (T) , said 
received transmission unit is also stored in the group 
associated with said second largest storage decision value 

30 (Si) . 



17. A method according to claim 12, wherein each 
multiplication factor (Wji) for the associated property 
field (j) and group (i) is calculated on the basis of the 
35 disorder (dji) of the values from said associated property 
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field (j) with respect to the transmission units stored in 
said group (i) in comparison to the transmission units 
stored in the other groups of said certain groups. 



18 



A method according to claim 17, comprising the steps: 



determining, for each of said predetermined number (P) of 
property fields, the number Kj of different values value* 
contained in the respective property fields of the 
transmission units stored in said certain groups, said 
different values value*, the number of times Qji (value*) 
that each of said values value* of each of said property 
fields (j) occurs in a given one (i) of said certain 
groups, the number of times NQji (value*) that each of said 
values value* of each of said property fields (j) occurs 
in the other groups of said certain groups besides said 
one given group, the total number Tji of times that one of 
said values value* occurs in the transmission units stored 
in all of said certain groups, and the total number N of 
transmission units stored in all of said certain groups, 

calculating said disorder dji through the following 
equation: 



d ^ T^value k )( Q fi (value k ) ^ ( Q Rvalue k j \ NQ ;i (value k ) ^ ( NQ^value^ 
Ji h N { T^value,)' ° g \ (value k ) J T fi (value k ) ' °*\ T, (value k ) ) 



where the term 



O,, (value,) ( Q^lne^ 
T ji (value k ) 2 ^ T p (value k ) j 



is set equal to zero 
zero, and the term 



if the value Qji (value*) 



is equal to 
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NQ Rvalue,) 
T. Rvalue k ) 



f ' NQ Rvalue k ) 



\ Tjt (value k ) J 



is set equal to zero if the value NQji (valued is equal to 
5 zero, and calculating said multiplication factor Wji 

associated with each of said property fields (j) and each 
of said groups (i) through the following equation 



where P is the number of predetermined property fields. 

19* A method according to claim 12, wherein a graphical 
user interface is provided' for entering parameters being 
15 displayed on a display means. 

20. A method according to claim 12, comprising the steps 
of regularly determining a characteristic date for each of 
said transmission units stored in said certain groups, 
20 comparing said characteristic date with the current date, 
and deleting said , transmission unit if the difference 
between said characteristic date and the current date . 
exceeds a predetermined time limit, 

25 21. A method according to claim 12, wherein said 

transmission units are electronic mail messages which are 
transmitted over a data network for carrying electronic 
mail messages. 





p 



10 



30 



22. Computer program for the receipt and storage of 
transmission units (TRU) containing information, each 
transmission unit having a format such that it contains a 



WO 99/04353 PCT/EP98/04342 

32 

plurality of unit property fields (10-14) that each 
contain a value associated with a property of said 
transmission unit, said computer program being designed to 
store said transmission units in a storage means, where 
5 said computer program is designed such that any 

transmission unit stored stored in said storage means is 
allocated to one or more of a plurality of predetermined 
groups (Gi, G2, . .., G n ; i) , said computer program 
implementing a method comprising the steps of: 

10 

determining the respective values in a predetermined 
number of unit property fields of a received transmission 
unit, and, 

15 for certain groups of said storage means, determining 

corresponding values in the same predetermined number of 
unit property fields of all transmission units in said 
certain groups, 

20 comparing said values from said received transmission unit 
with the values from said transmission units contained in 
said certain groups, 

counting, for each of said certain groups, the number of 
25 times (Oji) that a value from a specific unit property 

field (j) of said received transmission unit occurs in the 
same specific unit property field (j) of transmission 
units stored in said group (i) , to thereby determine a 
value occurrence value (Oji) for said same specific unit 
30 property field (j) of said group (i), 

multiplying each value occurrence value (Oji) for a given 
unit property field (j) and given group (i) with a 
multiplication factor (Wji) that depends on said given 
35 unit property field (j) and said given group (i) , to 
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thereby calculate. a number of group product values that 
are equal in number to said predetermined number of unit 
property fields, and, 

5 adding, for each of said certain groups, said group 

product values together to a sum, said sum being divided 
by the number (n*) of transmission units in said group 
(i), to thereby generate a storage decision value (Si) for 
each of said certain groups, and 

10 

deciding in which of all groups of said storage means to 
store said received transmission unit, on the basis of 
said storage decision values (Si) of said certain groups. 

15 23. Computer program for receiving and storing electronic 
messages, said electronic messages having a format such 
that they contain specific fields that contain information 
associated with a property of the electronic message, 
where said computer program is designed to store said 

20 electronic messages in a storage means, such that any 
electronic message being stored is allocated to one or 
more' of a plurality of predetermined groups, said computer 
program being designed to implement a method on a computer 
running said computer program, where said method comprises 

25 the steps of: 

determining predetermined information associated with a 
property of a received electronic message, and, 

30 for certain groups, determining corresponding information 
associated with the same property for all electronic 
messages in said certain groups, 

comparing said information from said received electronic 
35 message with the information from said electronic messages 
contained in said certain groups, 



WO 99/04353 



34 



PCT/EP98/04342 



counting, for each of said certain groups, the number of 
times that an information associated with a specific 
property of said received electronic message appears in 
5 the electronic messages stored in said group, to thereby 
determine an occurence value that indicates the occurence 
of said information associated with a specific property 
for said group, 

10 multiplying each occurence value for a given property and 
given group with a multiplication factor that depends on 
said given property and said given group, to thereby 
calculate a number of group product values, 

15 adding, for each of said certain groups, said group 

product values together to a sum, said sum being divided 
by the number of electronic messages in said group, to 
thereby generate a storage decision value for each of said 
certain groups, and 

20 

deciding in which of all groups of said storage means to 
store said received electronic message, on the basis of 
said storage decision values of said certain groups. 

25 24. Computer program according to claim 23, wherein said 
electronic messages are electronic mails, and said groups 
are electronic folders for receiving said electronic 
mails . 

30 25. Computer program for receiving and storing electronic 
messages, said electronic messages having a format such 
that they contain specific fields that contain information 
associated with a property of the electronic message, 
where said computer program is designed to store said 

35 electronic messages in a storage means, such that any 
electronic message being stored is allocated to one or 
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more of a plurality of predetermined groups, said computer 
program being designed to implement a method on a computer 
running said computer program, where said method comprises 
the steps of: 



determining predetermined information associated with a 
property of a received electronic message, and, 

for certain groups, determining corresponding information 
10 associated with the same property for all electronic 
messages in said certain groups, 

comparing said information from said received electronic 
message with the information from said electronic messages 
15 contained in said certain groups, 

counting, for each of said certain groups, the number of 
times that an information associated with a specific 
property of said received electronic message appears in 
20 the electronic messages stored in said group, to thereby 
determine an occurence value that indicates the occurence 
of said information associated with a specific property- 
for said group, 

25 multiplying each occurence value for a given property and 
given group with a multiplication factor that depends on 
said given property and said given group, to thereby 
calculate a number of group product values, 

30 adding, for each of said certain groups, said group 

product values together to a sum, said sum being divided 
by the number of electronic messages in said group, to 
thereby generate a storage decision value for each of said 
certain groups, and 



5 
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deciding in which of all groups of said storage means to 
store said received electronic message, on the basis of 
said storage decision values of said certain groups, where 
the storage decision values are compared and the largest 
5 value is determined, said received electronic message 
being stored in the group associated with said largest 
value . 



26. Computer program for receiving and storing electronic 
10 messages, said electronic messages having a format such 

that they contain specific fields that contain information 
associated with a property of the electronic message, 
where said computer program is designed to store said 
electronic messages in a storage means, such that any 
15 electronic message being stored is allocated to one or 

more of a plurality of predetermined groups, said computer 
program being designed to implement a method on a computer 
running said computer program, where said method comprises 
the steps of: 

20 

determining predetermined information associated with a 
property of a received electronic message, and, 

for certain groups, determining corresponding information 
25 associated with the same property for all electronic 
messages in said certain groups, 

comparing said information from said received electronic 
message with the information from said electronic messages 
30 contained in said certain groups, 

counting, for each of said certain groups, the number of 
times that an information associated with a specific 
property of said received electronic message appears in 
35 the electronic messages stored in said group, to thereby 
determine an occurence value that indicates the occurence 
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of said information associated with a specific property 
for said group, 



multiplying each occurence value for a given property and 
5 given group with a multiplication factor that depends on 
said given property and said given group, to thereby 
calculate a number of group product values, 

adding, for each of said certain groups, said group 
10 product values together to a sum, said sum being divided 
by the number of electronic messages in said group, to 
thereby generate a storage decision value for each of said 
certain groups, and 



15 deciding in which of all groups of said storage means to 
store said received electronic message, on the basis of 
said storage decision values of said certain groups, where 
the storage decision values are compared and the largest 
value is determined, said largest value being compared to 

20 a threshold value that depends on the group associated 
with said largest value, where said received electronic 
message is stored in said group associated with said 
largest value if said largest value exceeds said threshold 
value, and otherwise in a different, predetermined group. 
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